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Abstract. We consider n by n real matrices whose entries are non-degenerate 
random variables that are independent but non necessarily identically dis- 
tributed, and show that the probability that such a matrix is singular is 
0(l/^/n). The purpose of this note is to provide a short and elementary 
proof of this fact using a Bernoulli decomposition of arbitrary non degenerate 
random variables. 



1. Introduction 

Let M n = {flij) be a random n x n matrix, where the independent (non 

necessarily identically distributed) real random variables. We assume that the r.v. 
a,ij satisfy the following uniform non-degeneracy property 

(H) There exists p G]0, |[ such that for any i,j — 1, ■ • • , n, P(a,j > x^-) > p 

and ¥(an < x~A > p for some real numbers xj- < xj- . 
We provide an elementary proof of the following proposition. 

Proposition 1. Let M n be an n x n matrix whose coefficients are independent 
random variables satisfying (H). Then P(M n is singular) — 0(l/y / n). 

The study of the singularity of random matrices goes back, at least, to Komlos 
who showed in [Kol] that P(M„ is singular) = o(l) for independent and identically 
distributed (iid) Bernoulli entries, namely =0,1 with probability 1/2. Using 
Sperner's Lemma, Komlos noticed that the probability was 

C(n- 1/2 ) [b] j a result 

which has been further extended in [Si] to the case of iid entries equally distributed 
over a finite set. For iid Bernoulli entries, the conjecture is that P(M„ is singular) = 
(c + o(l))™ with c = h. Such an exponential behaviour has been successively 
obtained and improved in [KKoSl ITVTl ITV2] up to c = | . The value c=\ still 
seems to be out of reach. 

If one turns to general entries, Komlos proved in |Ko2j that P(M„ is singular) = 
o(l) for independent and identically distributed non degenerate random variables. 
Furthermore, as pointed out by Tao and Vu in [TVli Section 8], it follows from 
their analysis that P(M„ is singular) = o(l) for independent non degenerate entries, 
provided Property (H) holds. Under the same hypothesis PropositionQ] asserts that 
P(M n is singular) = O^ 1 / 2 ). 

But the main purpose of this note is to illustrate how the Bernoulli decomposition 
developed in |AGKW| may be used in order to extend results known for Bernoulli to 
the general case of independent non degenerate random variables. We perform that 
illustration by extending Komlos's argument as reproduced in [B], to independent 
random variables satisfying the Property (H). It is however not clear, at least to the 
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authors, whether results in |TVll [TV2j . and in particular Halasz-type arguments, 
could be extended in a similar way. 

2. Proof 

Our approach relies on the following lemma which is essentially contained in 
AGKW . For the reader's convenience we sketch its proof in the appendix. 

Lemma 2. Let M n be an n x n matrix whose coefficients are independent random 
variables satisfying (H). We can decompose the entries of the matrix M n as follows: 
For all there exist two independent random variables Wij and and functions 
fij :]0, 1[— ► K and % :]0, l[->]0, +oo[ such that 

1. 6ij is a Bernoulli random variable with parameter p^ g]0, 1[; 

2. uiij has the uniform distribution in ]0, 1[; 

3. ay = fij(v)ij) + 5ij(wij)eij. 

Moreover, p^ £]l — Pa,Pa[ for all where po = 1 — p. 

Remark 3. It is of crucial importance for us (see (6p in the proof of Lemma\B() 
that Sij > 0. We however do not need here a uniform bound from below on these 
Sij. In some situations, one actually does need such a uniform lower bound (see 
[AGKW] ), in which case it is sufficient to modify (H) above and require the existence 
of X- < x + independent of 

Thanks to Lemma[2]and since Wq and are independent r.v., we may adopt the 
following strategy to prove the proposition: 1. do the conditioning with respect to 
the variables Wy, so that, given the Wy's, M n becomes a sum of a constant matrix 
and of a random matrix with Bernoulli entries with probabilities (1 — Pij, Pij) and 
amplitudes dij(wij); 2. estimate, with respect to the Bernoulli variables e^, the 
probability that M n is singular following the strategy of [Bj; 3. take the expectation 
value with respect to the variables Wq. 

We shall denote by P^™) the conditional probability with respect to the w%j 
variables, i.e. pM(-) := F(-\{wij} id ). 

Following [B] , we introduce the strong rank of a system of vectors S = {vi , • ■ ■ , v n }, 
sr(S), to be the largest integer k such that any k of the Vj's are linearly indepen- 
dent. For an m by n matrix A, we denote by sr c (A) and sr r (^4) to be, respectively, 
the strong rank of the system of columns and of rows of A. 

The first ingredient of the proof is the following upper bound on the probability 
for an m by n matrix to have a "not too large" strong rank. 

Lemma 4. Let A be an m by n random matrix whose coefficients ay satisfy (H) . 
and w = (wij) be given. Then 

/ \ m— fe+1 / \ n— fc+1 

¥^(sr c {A) <k)<( I ^ and ¥^ w \sr r (A) < k) < ^ — . 

\ K J 1 — p \ K J 1 — p 

(1) 

Proof. The second statement is clearly equivalent to the first one (applied to A T ). 
By definition of the strong rank, sr c (A) < k if and only if there exists k columns 
of A which are linearly dependant. It thus suffices to show that for any 1 < i\ < 
■ • ■ < ik < n, 

m-fe+l 

pWlrankK,- ,v ik } < k) < ^ , (2) 

1 - Po 
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where the Vj denote the columns of A. Now we have 

pW(rankK,--- ,v ik } < k) (3) 
fc-i 

< PMK=0)+^PW(«, +1 GSpanK,-.. , ^.}|rankK, • • • ,v ij }=j). 

Let B denote the m by j matrix whose columns are the vectors , ■ ■ ■ , Vi j . If 

C 



B has rank j, without loss of generality we may decompose it as B = 



D 



where C is an invertible j by j matrix. In a similar way, let us decompose v%, +1 as 
Y , 

where Y has length j and Z length m — j. Note that for w given in 



Z 

the Bernoulli decomposition, the probability of each entry of Z taking a particular 
value is bounded by po . 

Then, Vi +1 G Spanjw^ , • • • , } iff there exists a vector u = (ui , • • • ,Uj) T such 
that Bu = Vi ,, and hence iff Cu = Y and Du = Z. But since C is invertible we 
finally get Vi j+1 € Spanjw^ , • • • , } iff Z = DC~ Y Y . Therefore we have 

P w (^ + i e Span{^ 17 • • • .WiJIrank^,- • • ,v i} } = j) (4) 
= E^ (P(-)(Z = DC^YIY)) < EP(p™- j ) = p™-\ (5) 

where denotes the conditional expectation with respect to the variables w and 
over the random vector Y. Inserting (TJ| into ([3]) and noting that P^'tv^ = 0) < 
p^, this proves ©. □ 
The second ingredient of the proof is the following improvement of ([2]) . 

Lemma 5. Let vi, • • • ,Vk € (k < n) be linearly independent and X — [a\, ■ ■ ■ ,a n ) 
a random vector whose coefficients satisfy (H). Suppose that sr r (^4) = s where A 
is the matrix whose columns are the vj 's. Then P^(X £ Span{«i, ■■■ ,v k }) < 

The above lemma relies on the following generalization of the Littlewood and Of- 
ford problem to the case of non necessarily identically distributed r.v. and which is 
an immediate consequence of an extended version of Sperner's lemma (see [AG KW , 
Lemma 3.2). 

Lemma 6. If a\, ■ ■ ■ , a s are non zero real numbers, fcgR and e%, ■ ■ ■ , e s indepen- 
dent Bernoulli random variables with parameter pi s]l — Po,Po[> then 

P(aid + ■ • ■ + a s e s = b) = 0(1/ i/s). 

Proof of Lemma[5j Let B denote the (n by k + 1) matrix A augmented with the 
column vector X, and let n, • • • , r n denote the rows of B. If X G Spanjui, • • ■ , v k } 
then B has rank k, so that without loss of generality we may assume that r\ , ■ ■ ■ , Tk 
are linearly independent, and that the others Vj depend on these. In particular 

fe+i 



^2 jidi = o, 
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where jk+i = 1 and, because sr r (B) > sr s (A) = s, at least s of the others 7, are 
non-zero. Thus, using Lemma[6j we have, recalling S(wi) > 0, 

/fe+i \ /fc+i fe+i \ 

^ = = P(U ° E 7i*(^)c< = - E ^/K) < C/a/^TT. (6) 

\i=l / \i=l i=l / 

Finally, in the same way as in the proof of (U), the for k + 2 < i < n are uniquely 
determined by a\, ■ ■ ■ , a^, and thus each of them has a probability at most pa to 
take a particular value. □ 
Proof of Proposition fTJ By Lemma [2] we have 

P(rank(M„) < n) = E {WijKj (p {w) (rank(M„) < n 

Let 0</3<a<ltobe specified. Let C\, ■ ■ ■ ,C n denote the column vectors of 
M n and write Ek for the event that Ci, • • ■ , Ck are linearly independent and Cfc+i 
depends on them. We then have 

n-l 

P M (rank(M„) < n) < P w (sr c (M„) < an) + P W (£fc). 

fc— an 

Indeed, if an < sr c (Af n ) < n there exists k > an such that Ci, • • • , C'k are inde- 
pendent but Cfc+i does depend on them. 

Fix now an < k < n, and denote by Ak the n by k matrix whose columns are 
C\ , ■ ■ ■ ,Ck- We have then 

V (w) (Ek) < P {w) (sr r (Ak) < (3n)+P {w) (Ek\sr r (Ak) > (3n). 

Using Lemmas 0] and we thus get 

P (u,) (rank(M„) < n) 

n(l-a) + l n-1 / , s. fc-/3n+l 



< 



< 



1 - Po ^ \ \ fin J l-po 

k—an x 



PO f n \ (l-a)n , PO / n A ( Q -/3)n 



(1-P0) 2 1 r 1 (i-po)V^ 



< C" ( e ™C 1 («) + ( 1 - Q ) ln Po) + e n(h(/3) + (a-/5)lnp ) + _L 

where = —x ln(x) — (1 — x) ln(l — a;) is the entropy function and we made used 
of the Stirling formula to get the last line. It finally suffices to take < < a < 1 
small enough so that h(a) + (1 — a) lnpo and h((3) + (a — (3) \npo are both strictly 
negative. □ 



3. Appendix 

For the reader's convenience, we recall the basic material from AG KWj and 
show how to extract from (H) the desired uniform estimates on the Bernoulli de- 
composition. Namely, we prove Lemma [21 

Let a be a random variable satisfying the estimates of Property (H) , with points 
x- < x+. We denote by /j, its law and by F its distribution function: F(u) — 
u(] — 00, it]). We set, for any t g]0, 1[, 

G{t) := m{{u,F(u) > t}. (7) 
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Note that 

G(t) < u <s=>> F(u) > t. (8) 
We set p_ = — oo, [), p + = p{\x + , +oo[), and p = 1 — p_. Thanks to (H), 
p > p + > p, and 1 — p = p_ > p, so that p €]p, 1 — p[. 

Following the idea of [AGKW1 Proof of Theorem 2.1], define for t e]0, 1[: 

*i(t) := G((l-p)t) 

r 2 (t) := z_-G(p_) + G(l-p+pf). (9) 

We always have G(p_) < x_, but these two numbers may differ. Note that since 
p(] — oo,a;_[) = p-, if it turns out that G(p_) < X-, then for any u G]G(p_),x_[ 
one has p_ < F(u) < p_, and thus p(]G(p-),x-{) = (this is the reason for the 
definition of Y2 which differs from the one of AGKW, Proof of Theorem 2.1]). We 
then let 

/(*) := Yi(t), (10) 
S(t) := Y 2 (t)-Y 1 (t), (11) 

so that, if e is a Bernoulli variable with probabilities (1 — p,p) and t a random 
variable with uniform distribution in ]0, 1[, we do have 

a = f(t)+S(t)e. (12) 

It remains to prove that S(t) > almost surely. Obviously, combining ([9]) and (fTT|) . 
S(t) > a;_ — G(p_) for any t, so that if x~ > G(p_) we are actually done. 
Suppose G(p_) = X-. We claim that T\ = 1 > T 2 where 

Ti = ia£{t e]0, 1[: = G(p_)} (arrival time of Yi) , 

T 2 = sup{t G]0, 1[ : Y 2 (t) = G(l - p + 0)} (departure time of Y" 2 ) ■ 

As in [AGKWl Proof of Theorem 2.1]) the latter then implies that 5(t) > for 
all t. It is easy to see that T 2 < (1 — p_ — p + )/(l — p_) < 1. It remains to 
show that T\ = 1. Suppose T\ < 1. For any t G]Ti,l[ and for any u < one 
has x_ = G{p-t) > u. Then ([8} implies that F(u) < p_i, and thus we get the 
following contradiction 

p_ = /i(] — 00, = sup F(u) < p-t < p_. (13) 
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